A Hybrid Model for Enhancing Lexical Statistical Machine Translation (SMT)

نویسندگان

  • Ahmed G. M. ElSayed
  • Ahmed S. Salama
  • Alaa El-Din M. El Ghazali
چکیده

The interest in statistical machine translation systems increases currently due to political and social events in the world. A proposed Statistical Machine Translation (SMT) based model that can be used to translate a sentence from the source Language (English) to the target language (Arabic) automatically through efficiently incorporating different statistical and Natural Language Processing (NLP) models such as language model, alignment model, phrase based model, reordering model, and translation model. These models are combined to enhance the performance of statistical machine translation (SMT). Many implementation tools have been used in this work such as Moses, Gizaa++, IRSTLM, KenLM, and BLEU. Based on the implementation, evaluation of this model, and comparing the generated translation with other implemented machine translation systems like Google Translate, it was proved that this proposed model has enhanced the results of the statistical machine translation, and forms a reliable and efficient model in this field of research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Machine Translation System Based on a Monotone Decoder

In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...

متن کامل

A Hybrid System for Chinese-English Patent Machine Translation

This paper presents a novel hybrid system, which combines rule-based machine translation (RBMT) with phrase-based statistical machine translation (SMT), to translate Chinese patent texts into English. The hybrid architecture is basically guided by the RBMT engine which processes source language parsing and transformation, generating proper syntactic trees for the target language. In the generat...

متن کامل

Hybrid Architectures for Multi-Engine Machine Translation

We describe different architectures that combine rule-based and statistical machine translation (RBMT and SMT) engines into hybrid systems. One of them allows to combine many existing MT engines in a multi-engine setup, which can be done under the control of a decoder for SMT. Another architecture uses lexical entries induced via SMT technology to be included in a rule-based system. For all the...

متن کامل

Can SMT and RBMT Improve each other's Performance?- An Experiment with English-Hindi Translation

Rule-based machine translation (RBMT) and Statistical machine translation (SMT) are two well-known approaches for translation which have their own benefits. System architecture of SMT often complements RBMT, and the vice-versa. In this paper, we propose an effective method of serial coupling where we attempt to build a hybrid model that exploits the benefits of both the architectures. The first...

متن کامل

Hybrid Machine Translation Architectures within and beyond the EuroMatrix project

This paper presents two hybrid architectures combining rulebased and statistical machine translation (RBMT and SMT) technology. In the first case, several existing MT engines are combined in a multiengine setup, and a decoder for SMT is used to select and combine the best expressions proposed by different engines. The other architecture uses lexical entries found using a combination of SMT tech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1506.01171  شماره 

صفحات  -

تاریخ انتشار 2015